Page 1 of 8 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/785 , 632A 



DATE: 05/25/2001 
TIME: 19:48:04 



Input Set : A:\Pto.amc 

Output Set: C:\CRF3\05252001\I785632A.raw 




4 
5 
6 
7 
8 
10 
11 



<110> APPLICANT: Kim, Jin-Soo 
Kwon, Young Do 
Kim, Hyun-Won 
Ryu, Eun-Hyun 
Hwang, Moon-Sun 



<120> TITLE OF INVENTION: ZINC FINGER DOMAINS AND METHODS OF 
IDENTIFYING SAME 



13 <130> FILE REFERENCE: 12279-002001 

15 <140> CURRENT APPLICATION NUMBER: 09/785, 632A 

16 <141> CURRENT FILING DATE: 2001-02-16 
18 <160> NUMBER OF SEQ ID NOS : 166 

20 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 10 

24 <212> TYPE: DNA 

25 <213> ORGANISM: HIV-1 

27 <400> SEQUENCE: 1 

28 gacatcgagc 10 

30 <210> SEQ ID NO: 2 

31 <211> LENGTH: 10 

32 <212> TYPE: DNA 

33 <213> ORGANISM: HIV-1 

35 <400> SEQUENCE: 2 

36 gcagctgctt 10 

38 <210> SEQ ID NO: 3 

39 <211> LENGTH: 10 

40 <212> TYPE: DNA 

41 <213> ORGANISM: HIV-1 

43 <400> SEQUENCE: 3 

44 gctggggact 10 

46 <210> SEQ ID NO: 4 

47 <211> LENGTH: 10 

48 <212> TYPE: DNA 

4 9 <213> ORGANISM: Homo sapiens 

51 <400> SEQUENCE: 4 . . 

52 agggtggagt 10 

54 <210> SEQ ID NO: 5 

55 <211> LENGTH: 10 

56 <212> TYPE: DNA 

57 <213> ORGANISM: Homo sapiens 

59 <400> SEQUENCE: 5 

60 gctgagacat 10 

62 <210> SEQ ID NO: 6 

63 <211> LENGTH: 47 

64 <212> TYPE: DNA 

65 <213> ORGANISM: Artificial Sequence 
67 <220> FEATURE: 
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68 <223> OTHER INFORMATION: optimal binding site 

70 <400> SEQUENCE: 6 

71 ccggcgtggg cggctgcgtg ggcgtgcgtg ggcggactgc gtgggcg 47 

73 <210> SEQ ID NO: 7 

74 <211> LENGTH: 4 7 

75 <212> TYPE: DNA 

76 <213> ORGANISM: Artificial Sequence 

78 <220> FEATURE: 

79 <223> OTHER INFORMATION: optimal binding site 

81 <400> SEQUENCE: 7 

82 tcgacgccca cgcagtccgc ccacgcacgc ccacgcagcc gcccacg 47 

84 <210> SEQ ID NO: 8 

85 <211> LENGTH: 49 

86 <212> TYPE: DNA 

87 <213> ORGANISM: HIV-1 

89 <400> SEQUENCE: 8 

90 ccggcgagcg ggcggtcgag cgggcgtgag cgggcggatc gagcgggcg 4 9 

92 <210> SEQ ID NO: 9 

93 <211> LENGTH: 49 

94 <212> TYPE: DNA 

95 <213> ORGANISM: HIV-1 

97 <400> SEQUENCE: 9 

98 tcgacgcccg ctcgatccgc ccgctcacgc ccgctcgacc gcccgctcg 49 

100 <210> SEQ ID NO: 10 

101 <211> LENGTH: 50 

102 <212> TYPE: DNA 

103 <213> ORGANISM: HIV-1 

105 <400> SEQUENCE: 10 

106 ccggctgctt gggcggctgc ttgggcgtgc ttgggcgggc tgcttgggcg 50 

108 <210> SEQ ID NO: 11 

109 <211> LENGTH: 50 

110 <212> TYPE: DNA 

111 <213> ORGANISM: HIV-1 

113 <400> SEQUENCE: 11 

114 tcgacgccca agcagcccgc ccaagcacgc ccaagcagcc gcccaagcag 50 

116 <210> SEQ ID NO: 12 

117 <211> LENGTH: 47 

118 <212> TYPE: DNA 

119 <213> ORGANISM: HIV-1 

121 <400> SEQUENCE: 12 

122 ccggactggg cgggggactg ggcgtgactg ggcggaggga ctgggcg 47 

124 <210> SEQ ID NO: 13 

125 <211> LENGTH: 47 

126 <212> TYPE: DNA 

127 <213> ORGANISM: HIV-1 

129 <400> SEQUENCE: 13 

130 tcgacgccca gtccctccgc ccagtcacgc ccagtccccc gcccagt 47 

132 <210> SEQ ID NO: 14 

133 <211> LENGTH: 47 
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134 <212> TYPE: DNA 

135 <213> ORGANISM: Homo sapiens 

137 <400> SEQUENCE: 14 

138 ccggagtggg cggtggagtg ggcgtgagtg ggcggatgga gtgggcg 47 

140 <210> SEQ ID NO: 15 

141 <211> LENGTH: 47 

142 <212> TYPE: DNA 

143 <213> ORGANISM: Homo sapiens 
145 <400> SEQUENCE: 15 

14 6 tcgacgccca ctccatccgc ccactcacgc ccactccacc gcccact 47 

148 <210> SEQ ID NO: 16 

149 <211> LENGTH: 48 

150 <212> TYPE: DNA 

151 <213> ORGANISM: Homo sapiens 

153 <400> SEQUENCE : 16 

154 ccggacatgg gcggagacat gggcgtacat gggcggaaga catgggcg 48 

156 <210> SEQ ID NO: 17 

157 <211> LENGTH: 48 

158 <212> TYPE: DNA 

159 <213> ORGANISM: Homo sapiens 

161 <400> SEQUENCE: 17 

162 tcgacgccca tgtcttccgc ccatgtacgc ccatgtctcc gcccatgt , 48 

164 <210> SEQ ID NO: 18 

165 <211> LENGTH: 120 

166 <212> TYPE: DNA 

167 <213> ORGANISM: Artificial Sequence 

169 <220> FEATURE 

170 <223> OTHER INFORMATION: plasmid sequence 



172 <221> NAME /KEY 

173 <222> LOCATION 
175 <400> SEQUENCE 



CDS 

(1) . . . (81) 
18 

176 aaa gag ggt ggg teg acc ttc egg act ggc cag gaa cgc cca gat ccg 48 

177 Lys Glu Gly Gly Ser Thr Phe Arg Thr Giy Gin Glu Arg Pro Asp Pro 

178 15 10 15 

180 egg gaa ttc aga tct act agt gcg gee get aag taagtaagac gtcgagctcg 101 

181 Arg Glu Phe Arg Ser Thr Ser Ala Ala Ala Lys 

182 20 25 

184 ccatcgcggt ggaagcttt 120 

186 <210> SEQ ID NO: 19 

187 <211> LENGTH: 27 

188 <212> TYPE: PRT 

189 <213> ORGANISM: Artificial Sequence 

191 <220> FEATURE: 

192 <223> OTHER INFORMATION: plasmid sequence 

194 <400> SEQUENCE: 19 

195 Lys Glu Gly Gly Ser Thr Phe Arg Thr Gly Gin Glu Arg Pro Asp Pro 

196 15 10 15 

197 Arg Glu Phe Arg Ser Thr Ser Ala Ala Ala Lys 

198 20 25 
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200 <210> SEQ ID NO: 20 

201 <211> LENGTH: 303 

202 <212> TYPE: DNA 

203 <213> ORGANISM: Artificial Sequence 

205 <220> FEATURE: 

206 <223> OTHER INFORMATION: plasmid sequence 



208 <221> NAME/KEY 

209 <222> LOCATION 
211 <400> SEQUENCE 



CDS 

(25) . . . (291) 
20 

212 gggtcgacct tccggactgg ccag gaa cgc cca tat get tgc cct gtc gag 51 

213 Glu Arg Pro Tyr Ala Cys Pro Val Glu 

214 15 

216 tec tgc gat cgc cgc ttt tct cgc teg gat gag ctt.acc cgc cat ate 99 

217 Ser Cys Asp Arg Arg Phe Ser . Arg Ser Asp Glu Leu Thr Arg His lie 

218 10 15 20 25 

220 cgc ate cac act ggc cag aag ccc ttc cag tgt cga ate tgc atg cgt 1-47 

221 Arg lie His Thr Gly Gin Lys Pro Phe Gin Cys Arg lie Cys Met Arg 

222 30 35 40 

224 aac ttc agt cgt agt gac cac ctt acc acc cac ate egg acc cac acc 195 

225 Asn Phe Ser Arg Ser Asp His Leu Thr Thr His lie Arg Thr His Thr 

226 45 50 55 

228 ggc gag aag cct ttt gec tgt gac att tgt ggg agg aag ttt gec agg 243 

229 Gly Glu Lys Pro Phe Ala Cys Asp He Cys Gly Arg Lys Phe Ala Arg 

230 60 65 70 

232 agt gat gaa cgc aag agg cat acc aaa ate cat tta aga cag aag gat 291 

233 Ser Asp Glu Arg Lys Arg His Thr Lys He His Leu Arg Gin Lys Asp 

234 75 80 85 

236 ccgcgggaat cc 303 

238 <210> SEQ ID NO: 21 

239 <211> LENGTH: 89 

240 <212> TYPE: PRT 

241 <213> ORGANISM: Artificial Sequence 

243 <220> FEATURE: 

244 <223> OTHER INFORMATION: plasmid sequence 

246 <400> SEQUENCE: 21 

247 Glu Arg Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser 

248 15 10 15 

249 Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly Gin Lys 

250 20 25 30 

251 Pro Phe Gin Cys Arg lie Cys Met Arg Asn Phe Ser Arg Ser Asp His 

252 35 40 45 

253 Leu Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro Phe Ala Cys 

254 50 55 60 

255 Asp He Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys Arg His 

256 65 70 75 80 

257 Thr Lys He His Leu Arg Gin Lys Asp 

258 85 

260 <210> SEQ ID NO: 22 

261 <211> LENGTH: 102 
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262 
263 
265 
266 
267 
269 
270 
271 
272 
274 
275 
276 
278 
279 
283 
284 
285 
286 
288 
289 
290 
291 
292 
293 
296 
297 
298 
299 
301 
302 
303 
305 
306 
307 
308 
310 
311 
312 
314 
315 
319 
320 
321 
322 
324 
325 
326 
327 
328 



<212> TYPE: DNA 
<213> ORGANISM: 
<220> FEATURE: 
<221> NAME /KEY 
<222> LOCATION 
<400> SEQUENCE 
acc ggg cag aaa 
Thr Gly Gin Lys 
1 

tgt ccc tea 
Cys Pro Ser 



ccc 
Pro 

egg 
Arg 



aac 
Asn 
20 



Homo sapiens 
CDS 

(1) . . . (102) 
22 

ccg tac aaa tgt 
Pro Tyr Lys Cys 
5 

ctt cga agg cat 
Leu Arg Arg His 



ccg 
Pro 

<210> SEQ ID NO: 
<211> LENGTH: 34 
<212> TYPE: PRT 
<213> ORGANISM: 
<400> SEQUENCE: 
Thr Gly Gin Lys 
1 

Cys Pro Ser Asn 

20 

Pro Arg 

<210> SEQ ID NO: 
<211> LENGTH: 10 
<212> TYPE: DNA 
<213> ORGANISM: 
<220> FEATURE: 
<221> NAME/KEY 
<222> LOCATION 
<400> SEQUENCE 



acc 
Thr 

1 
cac 
His 

ccg 
Pro 



ggg 

Gly 

age 
Ser 

egg 
Arg 



gag 
Glu 

tec 
Ser 



aag 
Lys 

aac 
Asn 
20 



23 



Homo sapiens 
23 

Pro Tyr Lys Cys 
5 

Leu Arg Arg His 



24 



Homo sapiens 
CDS 

(1) . . . (102) 
24 

cca tac aag tgt 
Pro Tyr Lys Cys 
5 

ttc aat aaa cac 
Phe Asn Lys His 



<210>. SEQ ID NO: 
<211> LENGTH: 34 
<212> TYPE: PRT 
<213> ORGANISM: 
<400> SEQUENCE: 
Thr Gly Glu Lys 
1 

His Ser Ser Asn 

20 



25 



Homo sapiens 
25 

Pro Tyr Lys Cys 
5 

Phe Asn Lys His 



aag caa tgt ggg aaa get ttt gga 
Lys Gin Cys Gly Lys Ala Phe Gly 

10 15 
gga agg act cac acc ggc gag aaa 
Gly Arg Thr His Thr Gly Glu Lys 
25 30 



48 



96 



102 



Lys Gin Cys Gly Lys Ala Phe Gly 

10 15 
Gly Arg Thr His Thr Gly Glu Lys 
25 30 



aag gag tgt ggg aaa gec ttc aac 

Lys Glu Cys Gly Lys Ala Phe Asn 

10 15 

cac aga ate cac acc ggc gaa aag 

His Arg He His Thr Gly Glu Lys 

25 30 



48 



96 



102 



Lys Glu Cys Gly Lys Ala Phe Asn 

10 15 
His Arg He His Thr Gly Glu Lys 
25 30 



y- 

Pleas Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to nsure that a corresponding explanation is presented in the <220> to 
<223> fields f each sequ nee which presents at least one n or Xaa. 
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L: 


1007 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 68 


L: 


1009 


M: 

It Jj • 


341 


W: 


: (46) 


"n " 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 

j- *— ' ■ i 


: 68 


L: 


1033 


M: 

j. 4 * 


341 


W: 


; (46) 


H — (I 


or 


"Xaa" 


used , 


for 


SEO 


ID# 


: 69 


L: 


1035 

■J- V V-/ 


M * 


341 


W: 


: (46) 


" n 11 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 

j- *j f ii 


: 69 


L: 


1059 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 70 


L: 


1061 

A w ^J' J* 


M • 


341 


W: 


: (46) 


"n " 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 70 


L: 


i085 


M • 


341 


W: 


: (46) 


"n " 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 

j- *— ' ii 


: 71 


L : 


1087 

J— V \^ J 


M : 

*- 4 * 


341 


W: 


: (46) 


"n " 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 71 


L : 


1115 

J- J*, 


M : 

L 4 ■ 


341 


W: 


: (46) 


"n " 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 72 


L: 


1117 


M : 

4 J ■ 


341 


W: 


; (46) 


11 n " 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 72 


L: 


1141 


M : 


341 


W: 


: (46) 


"n " 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 73 


L: 


1143 


M : 


341 


W: 


: (46) 


"n " 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 73 


L: 


1167 


M: 


341 


W: 


: ' (46) 


"n " 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 

*— ^ ii 


: 74 


L : 


1169 


M : 


341 


W: 


: (46) 


"n 11 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 74 


L : 


1197 

-l j* r 


M • 

i. 4 * 


341 


W: 


. (46) 


"n " 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 

•X- *^ 1 1 


: 75 


L • 


1199 

-i- -1- ~S 


M • 

L J t 


341 


W ' 


(46) 


it „ ri 

4 4 


or 


"Xaa " 


used , 


for 


SEQ 


ID# 


: 75 




1223 




341 


w ■ 


f 4 6 ) 


if j-. tl 


or 

V-/ -I- 


"Xaa" 


used. 


for 


SEQ 


ID# 


: 76 


L • 


1225 


M * 

Li* 


341 


w ■ 


f 46) 


IT — H 

1 4 


or 

N-hT -X^ 


"Xaa" 


used . 


for 

-L*. 


SEQ 


ID# 


: 76 


L • 
j-i • 


1281 
j. w j- 


Lit 


341 


W ' 


f 46) 


i? — it 

1 4 


or 


"Xaa" 


used . 


for 


SEQ 


ID# 


: 77 


L • 


1283 


Lit 


341 


W • 


f 46) 


u ~ n 

4 4 


1 

or 


"Xaa" 


used . 


for 


SEQ 


ID# 


: 77 


L * 


1305 


L 1 i 


341 


w * 


(46) 


it j-. it 

4 4 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 

*— ' ii 


: 78 


T. • 
xj * 


134 4 

X T T 


Lit 


341 

T X 




( 46) 


it — it 


or 


"Xaa" 


used . 


for 


SEO 


ID# 

j- ii 


: 81 

• v JW 


T, • 
xj * 


1360 
j- \j w 


Lit 


341 

•»J T X 




(46) 


it — it 

4 4 


or 


"Xaa" 


used . 


for 

j_ v j_ 


SEO 


ID# 


: 82 




1383 

X «-V \J 


L 1 « 


341 


w ■ 


(46) 


it ^ it 

4 4 


or 


"Xaa" 


used . 


for 


SEQ 


ID# 

*— ' ii 


: 83 


T. • 
j_i * 


J. X \J 


Lit 


341 


w • 

V 1 i 


* (46) 


" n 11 

1 X 


or 

u X 


"Xaa" 


used , 


for 

-L*. X^/ J_ 


SEO 


ID# 


: 85 




1 4 4 8 

1 7 1 U 




341 

*J T X 


w ■ 


1 (46) 


It ^ It 

1 1 


o r 

w X 


"Xaa" 


11 9PH 


for 


SEO 

o i_j v 


ID# 


• 88 


T, • 


14 63 

X T \J sJ 




341 


W ' 

V V i 


1 (46) 


It p It 
1 1 


or 

W X 




<J / 


for 


SEO 


ID# 


- 89 


T . • 
u • 


1700 


Lit 


341 

*j n x 




1 (4 6) 


II It 

1 1 


or 

W X 


"Xaa" 

4 \ V-4. C4. 


n «?Pfi . 

O ^ J 


for 


SEO 


ID# 


• 108 


T, • 
xj • 


1701 


Lit 


341 




■ (4 6) 


II — II 

1 1 


or 

W X 


"Xaa" 

<VC4 


n cjpd . 

V-l O \^ V^i ^ 


for 


SEO 

u 


ID# 


- 108 


T , • 
xj » 


X / X \J 




341 
j m i 


w ■ 

VV i 


(46) 


II It 


o r 

^ X 


"Xaa" 


1 1 qprj 

LA O >w V-4. f 


for 

_1_ v.x -i* 


SEO 

O Xj v 


ID# 

J- LS IJ 


■ 109 


xj * 


1717 

X / X f 




341 

-J T X 


w ■ 

VK i 


' (4 6) 




o r 


"Xaa" 

£ \ C4 


n qpH . 

>w> \4 ^ 


for 

X. / ^ 


SEO 


ID# 


• 109 


T, • 
j_i • 


2 334 


M ■ 

Lit 


341 


W ' 


■ (46) 


M p II 


o r 


"Xaa " 

C X C4 d 


used . 

KA O f 


for 


SEO 


ID# 

-4- I—' I) 


• 150 

• j_ \j 




2336 


Lit 


341 


W ' 


• (46) 


II _ II 

1 X 


o r 


"Xaa" 

£ i C4 


used . 

\mJL k_/ S^-4 f 


for 


SEO 


ID# 

-L. u ll 


: 150 


T, • 

XJ • 


2360 
£~ w» \j \j 


M ■ 

Lit 


341 

~J ~ -L 


W ' 


• (46) 


It II 


or 


"Xaa" 


used . 


for 


SEO 


ID# 

j- *— ' ii 


: 151 


XJ • 


2 362 


Lit 


341 

»J T X 




1 (4 6) 


II _ II 

1 1 


o r 

W X 


"Xaa" 

4 4 U C4 


ii qpH . 


for 


SEO 


ID# 


* 151 


T. • 
XJ • 


238 6 


Lit 


341 

T X 


w ■ 


' (46) 


II _ II 

1 1 


or 

W X 


"Xaa" 

£\ L>4 (-4 


used , 

LhI >W ^t. ^ 


for 


SEO 


ID# 


: 152 


T, • 
xj * 


2388 


Lit 


34 1 

*J T X 




1 (46) 


tl — II 

i L 


o r 

^ X 


"Xaa " 


nqpH , 

U O ^ \A , 


for 


SEO 


ID# 

-X Lm/ Tl 


: 152 


L: 


2412 


M: 


341 


W; 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 153 


L: 


2414 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 153 


L: 


2438 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 154 


L: 


2440 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 154 


L : 


2464 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:155 


L: 


2466 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 155 


L: 


2490 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 156 


L: 


2492 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 156 


L: 


2516 


M: 


341 


W: 


; (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 157 


L: 


2518 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 157 


L: 


2542 


M: 


341 


W: 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 158 
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L: 


2544 


M: 


341 


W 

¥ ¥ 


: (46) 


»n" 


or 


"Xaa" 


used , 


for 


SEO 


ID# 


: 158 


L : 


2568 

4mm >J \J \y 


M * 


341 


w 


■ (46) 


"n " 


or 


"Xaa" 


used , 


for 


SEO 


ID# 


: 159 


L : 


2570 
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